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SENTENCE RE::ONSTRUCTION USING 
WORD AMR TGUTTY RESOLUTION 

This invention relates to resolution of word ambiguity 
in the context of a sentence structure having a plurality of 
word choices for at least one word position. More 
particularly, sentences entered by use of a three letter per 
key telephone keypad are reconstructed using syntaxrbased 
word use rule sets reflecting word sequence and word 
category positioning usage in a selected language. 



BACKGROUND OF THF TMVFMTTnfJ 

In entry of text by use of a standard typewriter form 
of keyboard each key stroke represents a single letter on a 
non-ambiguous basis. For many applications it is desirable 
to be able to enter text by use of a readily available 
telephone having a standard telephone keypad. For example, 
such text entry would enable an untrained caller to 
communicate with a hearing-impaired person from any 
available telephone. However, telephone keypads are 
overloaded with three letters to a single key which creates 
ambiguities as to which letter is intended by a single key 
stroke. Prior approaches toward resolving the ambiguities 
have typically required two or more key strokes for entry of 
each letter. Resulting text entry routines have been 
cumbersome, slow and frustrating to use. 

The marriage of computing and telecommunications has 




spawned an amS^ing variety of new services and capabilities. 
Many individuals are currently excluded from p irticipation 
because they do not have ready access to an alphanumeric" 
terminal and modem. However, most' homes in many countries 
already contain a simple on-line terminal in the -form of a 
telephone. Anyone who has ever placed a telephone call is 
familiar with the fact that the 2-9 keys of a standard phone 
dial are each labeled with three letters of the alphabet 
(the letters Q and Z do not appear) . These labels have been 
used to provide mnemonics to aid in memorizing phone numbers 
(such as 1-800-FLOWERS) and in modern messaging systems for 
entering single word strings (such as a database of employee 
names in a company switchboard application) . However, the 
overloading of three letters on a single key creates a 
potential ambiguity as to which character was intended. 

Resolution of this overloading ambiguity is necessary 
for unambiguous text entry. For applications involving a 
small, predetermined vocabulary (such as the switchboard 
application noted above) , a simple hashing scheme will 
suffice. However the problem becomes considerably more 
difficult if arbitrary text is to be entered (i.e., text 
entry free of predetermined constraints on word usage) . 

A variety of different text entry conventions have been - 
proposed for use with telephone keypad to resolve or reduce 
ambiguities resulting from the multi-letter keys, without 
attempting to address all prior techniques, it can be noted 
that numerous prior approaches have addressed ambiguity on a 
letter level. For example, "L" is the third character on 
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the number 5 Icey and an unambiguous entry can be made by 
pressing the 5 key three times, followed by pressing the O 
key to indicate completion of entry of the third letter on 
the 5 key. Alternatively, L can be entered by pressing the 
5 5 key followed by another key predesignated to indicate the 
third letter on the 5 key. In each case the letter L is 
unambiguously entered, but a multi-stroke procedure must be 
learned and then carried out for each letter. 

Other approaches have addressed ambiguity resolution at 
10 a v;ord level. Typically, if three keys are activated to 
represent a three letter word, for example, a dictionary 
3 type word list can be accessed to identify all English words 

fA formable by the permutations of the three letters 

==5 represented in each of the three successive letter 

l2 15 positions. One approach is to limit word possibilities to a 
restricted list of words, names or locations so that it is 
predetermined that only one word can be represented by that 
specific three key seguence. A word level approach is 
discussed in the article, "Let Your Fingers Do the Spelling: 
20 Disambiguating Words Spelled With the Telephone Keypad", J. 
R. Davis, AVIOS Journal, 9:57-66, March 1991. Unambiguous 
operation with a limited word list of streets, words or 
people is discussed. This article specifically acknowledges- 
the absence of any capability to go beyond a word level 
25 operation constrained by a predefined limited word list, 
concluding that: "Implicit disambiguation does not appear 
feasible for entering unrestricted English text". " 

A word level approach which permits less restricted 
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text entry is oescribed in U.S. patent No. 4,650,927, issued 
March 17, 1987. This patent recognizes the basic inability 
to choose the correct word when the text entry via a 
telephone keypad results in a list' of two or more word 
5 possibilities for a particular word position. Instead of 
reducing word ambiguity by limiting the list of acceptable 
words, this patent presents the list of possible words to 
the human recipient of a message. Us-ers presented with such 
word choices must then use their own judgment in selecting 

10 the word which makes the most sense to them in the 
sequential context presented to them. 

Basic shortcomings of all known prior approaches are 
that they have been either slow and cumbersome if operating 
on a letter level of ambiguity resolution, or limited in 

15 overall capability and performance if operating on a word 

level of ambiguity resolution. No known prior approach has 
provided a practical capability of ambiguity resolution on a 
sentence level utilizing syntax relationships regarding 
usage of words in a selected language. 

20 Objects of the present invention are, therefore, to 

provide new and improved sentence reconstruction methods and 
systems having one or more of the following capabilities: 

- resolution of word ambiguities based on syntax 
relationships regarding word usage; 

25 - unrestricted text entry via a t-elephone keypad; 

- word ambiguity resolution based upon probabilities of 
word usage and probabilities of one or both of - 
particular word sequences and positioning of words of 
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particular word categories; and 
- simple text entry protocol. 



SUMMARY OF THE TMVKNTTOM 

In accordance with the invention, a sentence 
reconstruction method, for resolving word ambiguities in a 
selected language sentence structure entered using single 
stroke activation of a key set including text entry keys 
each representing a group of letters, includes the following 
steps: 

(a) providing first key stroke indicia each 
ambiguously representing a letter of a group of letters and 
second key stroke indicia including indicia representing 
spaces between words; 

(b) partitioning (by use of the second key stroke 
indicia) the first key stroke indicia into a sequence of 
word positions, each word position comprising a code block 
represented by at least one of the first key stroke indicia; 

(c) accessing a database including a word list to 
identify for an individual word position a word group 
including alternative word choices formable from the letter 
groups represented by the code block for such word position; - 

(d) repeating step (c) for the sequence of word 
positions to identify a corresponding word group including 
at least one word choice for each of a plurality of word 
positions; 

(e) utilizing a stored word use rule set 
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representative of relative frequency of particular word 
usage in the selected language to derive, for the word group 
for one of the word positions, probability values for word 
choices for that word position; 

(f ) utilizing a stored language rule set 
representative of usage in the selected language to derive 
probability values for a sequencing of individual word 
choices for the word position relative to at least one word 
choice for an adjacent word position in the sentence 
structure, said language rule set including rules in the 
following categories (i) rules based on transitional 
probability of use of particular word sequences, and (ii) 
rules based on probability of relative positioning of words 
of particular word categories in a sentence structure; 
i5 (g) repeating steps (e) and (f) for any additional 

word positions having an associated word group including a 
plurality of alternative word choices; and 

(h) selecting, by use of the probability values 
derived in steps (e) and (f ) , one word from each word group 
20 for inclusion at a respective word position in a 
reconstructed sentence structure. 

In a preferred embodiment of the sentence 
reconstruction method, in step (a) the second key stroke 
indicia additionally includes indicia representing a period 
25 delimiting the sentence structure arid step (b) additionally 
includes partitioning the word positions into at least one 
sentence. Between steps (c) and (d) an additional step may 
also be included as follows: 
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(x) for each word position for which no word group 
including a:: least one word is identified in step (c) 
utilizing a stored word assembler to attempt to identify at 
least one of a suffix construction', a prefix construction 
and a combination word construction, and to thereb>y identify 
a word group including at least one word for the word 
position. 

The sentence reconstruction method may additionally include 
the following step: 

(j) using the words selected in step (h) to provide a 
reconstruction of the entered sentence structure in at least 
one of the following forms: a viewable display, a printout, 
a synthesized voice output. 

For a better understanding of the invention, together 
15 with other and further objects, reference is made to the 

accompanying drawings and the scope of the invention will be 
pointed out in the accompanying claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a block diagram of an embodiment of a 
sentence reconstruction system in accordance with the 
invention. 

Fig. 2 shows a telephone keypad. 

Fig. 3 illustrates phases of a 'Sentence reconstruction 
method using the invention. 
25 Fig. 4 illustrates a blank resolution technique usable 

in the Fig. 3 method. 
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Fig. 5 inustrates word ambiguities resulting from text 
entry via a telephone keypad. 

Fig. 6 illustrates telephone keypad code strings with 
greatest ambiguities. 

Fig. 7 is a trellis diagram useful in describing 
operation of the invention. 

Fig. 8 shows samples of word tags used in accordance 
with the invention. 

Fig. 9 is useful in considering a performance analysis. 

Fig. 10 is useful in describing results achieved in use 
of the invention. 

Fig. 11 is a sample of reconstructed sentence structure 
provided by operation of the invention. 

Fig. 12 is a flow diagram useful in describing a 
sentence reconstruction method in accordance with the 
invention. 



DESCRIPTION OF THE INVENTION 



Fig. 1 illustrates an embodiment of a sentence 
reconstruction system in accordance with the invention. 
Following a brief - description of the Fig. 1 system, theory 
and operation of a sentence reconstruction method using the 
invention will be described in greater detail. 

As shown, the Fig. 1 system includes an entry device, 
shown as telephone 10 having a keypad as represented in Fig. 
2. When a user presses a three-letter key, such as "one of 
keys 2-9 in Fig. 2, the entry device 10 provides a first key 



stroke indicia in the form of a telephone tone signal which 
can be transmitted via a telephone system, receive*: and 
stored in memory (e.g., in digital form) in known nanner. 
As will be discussed further, second key stroke indicia can 
also be provided, as by pressing the asterisk key- to 
indicate a blank character (e.g., a space between words) or 
by pressing the pound key to indicate a sentence limit or 
termination. The entry device 10 is tihus usable to enter 
text in the form of one or a series of sentence structures. 
For present purposes, "a sentence structure" is defined as a 
complete sentence, a sentence fragment, a phrase, a clause 
or other sequence of words put together in view of one or 
more syntax considerations based on word usage in a selected 
language, such as English. 

A processor, shown as CPU 12 which may comprise a 
microprocessor, is responsive to entry device 10 to cause 
the first and second key stroke indicia, coupled via input 
terminal 11, to be stored in memory unit 14. It will be 
appreciated that entry device 10 may typically be any 
telephone of a type having a keypad for providing standard 
tone signals and may be at any convenient remote or other 
location. Different telephones may be connected at 
different times and may have touch panels or other 
arrangements permitting text entry. As shown, memory unit 
14, which may be any appropriate type or form of storage 
device or devices, includes a data memory portion 16 and a 
program memory portion 18. Data memory portion 16 includes 
both database storage as will be described and text storage 



suitable for s^rage and retrieval of first and second key 
stroke indicia resulting from entry of one or a series of 
sentence structures via telephone 10. 

The database storage of data memory portion 16 is 
arranged in accordance with an embodiment of the invention 
to store the following. A stored word list (which may 
include on the order of 50,000 words representative of all 
relatively commonly used English words) is usable to 
identify for an individual word position a word group 
including all alternative word choices formable by 
permutation of the letter groups represented by a code block 
of the letters carried by the keys activated for entry of a 
particular word. A stored word use rule set is arranged to 
be representative of relative frequency of particular word 
usage in the selected language in order to enable derivation 
of probability values for word choices in a sentence 
structure. A stored language rule set representative of 
usage in the selected language enables derivation of 
probability values for word sequencing (e.g., for sequencing 
of individual word choices or candidates for a given word 
position, relative to a word choice for an adjacent word 
position in a sentence structure) . The language rule set 
includes rules in the following categories (i) rules based 
on transitional probability of use of particular word 
sequences, and (ii) rules based on probability of 
positioning of words of particular word categories in a 
sentence structure. Thus, category (i) may provide 'the 
relative probability that two particular words will be used 
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in sequence in common usage in the English language. 
Category (ii) may, for example, provide the probability that 
a word categorized as a noun will appear at the end of a 
sentence. The nature of these rule sets will be further 
5 described. 

The program memory portion 18 of the memory unit 14 is 
arranged in accordance with this embodiment of the invention 
to comprise the following which may be considered as program 
units usable with the CPU 12 to practice the invention, A 
10 partitioning unit 18a is responsive to the second key stroke 
indicia to partition the first key stroke indicia, as stored 
.^g in memory portion 16, into a sequence of word positions. 

\q Each word position will thus comprise a code block 

q represented by at least one of the first key stroke indicia 

il 15 (e.g., indicia of activation of the ghi key). An access 
m unit 18b is responsive to the code block for each word 

f| position to access the word list stored in data memory 

^l^ portion 16 to identify for an individual word position a 

word group including alternative word choices formable from 
20 the letter groups represented by the code block. It will be 
appreciated that for some code blocks the letter group will 
uniquely represent only one word (in the simplest case, the 
abc letter group represents only the word "a") . However, it - 
is most likely at least one code block of a sentence 
25 structure will provide letter groups'" formable into a 

plurality of words, presenting a word ambiguity addressable 
by the invention. A word probability unit 18c stored in 
program memory portion 18 is arranged to utilize the word 
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use rule set Scored in data memory portion 16 to derive, for 
the word group for each word position, probability values 
for word choices for each of the word positions (e.g., based 
on frequency of occurrence in common English usage) . A 
5 syntax unit I8d is arranged to utilize the rules .of 

categories (i) and (ii) of the language rule set stored in 
data memory portion 16. The syntax unit is thus enabled to 
derive probability values for a sequencing of alternative 
word choices for individual word positions relative to word 
10 choices for adjacent word positions in the sentence 

structure (e.g., based on two word sequences and word 
category positioning as discussed) . A selection unit 18e is 
arranged to utilize the probability values derived in units 
18c and 18d to select one word choice from the word group 
N 15 for each word position for inclusion at the respective word 
Q position in the sentence structure. As will be described 

□ further, the word choices selected for inclusion in the 

^ reconstructed sentence structure will typically represent 

the word choices associated with the highest probability 
20 values or costs as derived. Program memory portion 18 may 
also store a word assembler unit 18f for identification of 
suffix construction, prefix construction and combination 
word construction with respect to word positions for which 
access unit 18b is unable to identify any word choice from 
25 the w6rd list which corresponds to the letter groups 
represented by the code block for the respective word 
position. 

The Fig. 1 sentence reconstruction system further 
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includes an output device to provide a representation of the 
entered sentence structure (in the form of the sentence 
structure as reconstructed) for use by an individual 
utilizing the system. As illustrated, the output device may 
comprise a display. 20 providing a viewable display of the 
sentence structure, a printer 22 providing a hard copy 
printout, a speaker 24 providing a synthesized speech 
output, or other appropriate device or combination thereof 
suitable for making the sentence structure available in a 
human perceivable format. The invention has been described 
in the context of a single sentence structure, such as a 
complete sentence, which may be a portion of more extensive 
text including a series of sentence structures. 
BASIC OPERATION 

As background to a description of operation, 
considerations relevant to the bounds of achievable text 
reconstruction under information theory constraints will be 
briefly addressed. Before attempting to reconstruct English 
text from single-key phone codes, it should be established 
that there is enough information available to make such a 
reconstruction possible. Formally, an information source S 
with a repository- of possible messages Mj^, 1 < k < n selects 
one message for transmission. Information theory defines 
the information content or source entropy H(S) of a set of 
messages in terms of the probability" P^'M^; that this message 
is chosen by the source, as the average value of information 
associated with each of the messages as represented in the 
following equation. 



H(S) = -Y, P iM.) log ^P(M.) 

i = l . 



The source entropy H(S) denotes the average information 
content conveyed per message, in units of bits. The source 
entropy reaches its maximum for the case of equally likely 
messages, when H(S) = log2 n. 

For present purposes, the message source is the English 
text itself and measuring the true entropy of English is a 
non-trivial problem. If one imagines the message source to 
consist of 26 letters, the space and the period, each of 
which are used with equal probability, then the source 
entropy is log228 =4.81 bits per letter. Since the 
information content which can be transmitted by using the 
phone keypad is at most Iog2l0 = 3.32 bits per keystroke, 
there remains a considerable deficit which must be overcome. 

English, however, does not use all letters with an 
equal probability. Using the probabilities from the Brown 
Corpus (as provided in Frequency Analysis of English Usage: 
Lexicon and Grammar, edited by W.N. Francis, Houghton 
Mifflin, Boston, 1982) the source entropy of first-order 
English drops to 4.16 bits per letter. However, standard 
assignment of letters to phone keys is not optimal and 
lowers the transmitted entropy to approximately 3 . 07 bits 
per letter, leaving a deficit of over one bit per symbol. A 
better measure of the entropy of English comes from 
considering entire words or sentences as messages, rather 
than the individual letters. On a word-by-word basis. 
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estimates of the entropy of English drop to below 2.2 bits 
per letter (see C.E, Shannon, "Predic :ion and Entropy of 
Printed English", Bell System Technical Journal , 30:50-64, 
1951) . In a series of experiments' on full English text, 
Shannon obtained an estimate of the entropy of English as 
between 0.6 and 1.3 bits per letter. Since the information 
capacity of telephone keycodes exceeds Shannon's estimate of 
entropy, in principle it is possible t:o accurately 
reconstruct the source text by exploiting context and other 
linguistical constraints. 

A reconstruction system using the present invention 
takes as input a stream of digits, and outputs a maximum- 
likelihood reconstructed English source text. The following 
input convention is used in a preferred embodiment. To 
supplement the traditionally-labeled keys 2-9, the asterisk 
key * is used for the letters Q, Z and )6, the blank 
character. The pound key # serves as sentence delimiter. 
The 0 and 1 keys have no meaning and could be used for 
additional features. Hyphens and other punctuation are 
omitted or substituted by blank characters. Ordinal and 
cardinal numbers are entered by spelling the numbers out in 
this exemplary convention which ends each sentence with a #. 
The reconstruction proceeds on a sentence-by-sentence basis - 
in four phases as represented in Fig. 3. 

Initially, blanks are identified by use of partitioning 
unit 18a stored in program memory portion 18. The first 
phase partitions the input stream into word positions . or 
tokens by recognizing the blank characters, and is rendered 
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non-trivial because of overloading the * key with Q and Z in 
the chosen input convention. Since blanks are much more 
common than Q and Z, simply marking all asterisks as blanK 
would be correct 98.99% of the time. However, a better job 
of identifying the- real blanks can be done. 

The decision as to whether an asterisk represents a 
blank depends on the character context of the asterisk. A 
window of three characters preceding and following the 
asterisk is considered as context, as in Fig. 4. These pre- 
asterisk and post-asterisk trigrams are composed of phone 
codes, resulting in 10^ possible combinations for each 
trigram. For each trigram, there is tabulated how often the 
asterisk was intended as Q, Z, or blank, yielding six 
quantities: 

• T^^: How often did the letter c G {Q,Z,J6} occur as the 
fourth character of the pre-asterisk trigram? 

• T2^: How often did the letter c e {Q,Z,)6} occur as the 
first character of the post-asterisk trigram? 

The blank confidence is defined as represented by the 
following equation. 

-C(*-iJ) = — 1 ? 

The asterisks which are classified with high confidence as ^ 
are now so labeled, while other asterisks remain 
unidentified until Jater phases of our algorithm. In 
setting the confidence threshold C (* - Y>) = 1.0, 81.63% of 
all asterisks are marked as )6 while making no mistakes. 
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Using lower COTreidence thresholds to claSsify the asterisks 
significantly increases the number of blanks correctly 
marked, at a cost of a small number of errors. The best 
accuracy for reconstructing asterisks is obtained for a 
marking threshold of 99,97%, The overall blank recognition 
accuracy is then 99.91%. Of course, if the text input 
convention were changed to use the asterisk key only for 
blank identification this accuracy fiugure would be 100%. 

Word groups are then identified by use of word list 
access unit 18b stored in program memory portion IB I For 
each token or word position there is identified a word group 
of alternative word choices or candidates formable from the 
letter groups represented by a respective code block 
reflecting one or more keystrokes. The word group is 
identified by accessing a stored dictionary-type word list 
of English words and determining which words hash to the key 
codes of the token. 

Unknown words are addressed in word assembler unit 18 f 
stored in program memory portion 18 . If no dictionary word 
entry exists for the token, a character interpretation is 
constructed using partial dictionary matches, affix/suffix 
analysis, and by. optimizing over character probabilities 
and transitional letter probabilities. 

Sentence reconstruction can now proceed. A sentence is 
composed of a list of tokens termina-ted with a period. Each 
token is now labeled with a list of one or more character 
interpretations, defining a set of possible sentence- 
interpretations. A dynamic programming algorithm is applied 
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to eliminate candidates which are unlikely because of word- 
pair probabilities and grammatical constraints 
probabilities, and returns the most likely word sequence. 
DETAILED OPERATION- 

Aspects of the system operation will be considered in 
greater detail. The blank recognition process partitions 
the input stream into sequences of code strings or code 
blocks. Each code sequence represents a small number of 
consecutive English words, separated by the remaining 
unlabeled asterisks. 

The dictionary matching phase seeks to map each code 
sequence with a unique word in a word list dictionary of 
nearly 50,000 words. Most of the words in the dictionary 
map to unique phone code sequences, however, approximately 
16% of the phone codes collide with other dictionary words. 
The distribution of collisions is shown in Fig. 5. As an 
extreme case, the code string "22737" collides with eleven 
distinct English words, as shown in Fig. 6. Designing a 
good word list involves tradeoffs. All common words must be 
in the word list, although adding very seldom-used words 
might decrease the overall accuracy due to increased 
collisions with other words. The word list was built from 
an available word list enhanced by other common words. 
Abbreviations such as BMW and ACM were removed, yielding a 
word list with 53324 words hashed to 44474 code entries. 
The lookup succeeds with the chosen word list for about 80% 
of all tokens in unrestricted text entry. 

For the case of tokens that can not be resolved in the 



previous phase a series of further reconstruction attempts 
is-made by use of a word assembler unit 18f stored in memory 
portion 18, If the dictionary lookup fails, one reason 
might be that the code contains more than one English word. 
5 Thus an attempt is made to find a dictionary entry for a 
segment of the code which is delimited by an asterisk and 
starts at character position If the attempt succeeds the 
code string is split at the asterisk position and a new 
token is created for the rest of the string. Single letters 
10 are not split off. For the new token with the code 

remainder the process restarts. Many tokens contain only 
e5 two English words, and would be found in the dictionary had 

Q the separating asterisk had been interpreted as blank. If 

the split fails, the code string is checked for possible 
l::^^ 15 affixes. Suffixes and prefixes which are found in the code 
% are stripped off, and the remaining string is looked up in 

^ the dictionary. If multiple entries exist, the most 

^ frequent entry is used. If no entry is found, the affix is 

discarded and the code is passed to the next subphase. 
20 For words which are completely unknown to the 

dictionary, such as proper names and foreign terms, a 
recursive match procedure is used. The decode suggestion is 
assembled from partial matches of the code and dictionary 
entries. The final candidate is chosen from all possible 
25 matches according to the following preference list: 

(1) The candidate with the most reconstructed characters 
wins • 

(2) The candidate with the fewest partial matches wins. 
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(3) The candraate with the most preservea word boundaries 

wins, i.e., asterisks which turn to blanks. 
This match routine is not able to reconstruct certain kinds 
of words that can not be constructed from English words, 
5 such as foreign names. To catch these cases a rejecting 

rule is applied: a string is accepted if it consists of at 
most two more partial matches than the number of contained 
blanks. Otherwise it is rejected. 

For the remaining unknown words a dynamic programming 

10 algorithm is applied. A second-order transitional 

probability model is used, based on the probability of 
observing letter given previous letters L±-2^±-i, as 
estimated using the Brown Corpus. The Viterbi algorithm is 
a known form of dynamic programming algorithm for finding a 

15 global optimum for the given probabilities. It will be 
further described below, applied to words of a sentence 
rather than letters of a word. 

The input for proceeding with sentence reconstruction 
is thus a list of tokens corresponding to a sentence of the 

20 text input • Each token or word position contains a word 
group including a list of one or more possible 
interpretations tor its code block. Depending on the size 
of these interpretation lists, there may be an exponential 
number of possible sentence candidates. Grammatical 

25 constraints are employed within the- -Viterbi algorithm to 
eliminate unlikely or implausible candidates, to find the 
most likely sentence for the phone code string. The Viterbi 
algorithm is employed as follows. Let X = XqK^^. . .X^^^^ 
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denote a given sequence of phone code tokens, where the 
are the isolated word codes and Xq and X^^^ the sentence 

delimiters. Further, let Z = ZqZj^...Z^^^j^ be an English^ 
sentence consistent with X, where 2^ and Z^^^ are periods. 
The Viterbi algorithm seeks the interpretation Z that 
maximizes P(Z/X) , the probability that Z caused X. It takes 
into account: 

P(Z±/X^) , the probability that the word Z^ was intended 
when the code was observed; and 

P(Zj^/Zj^_j^) , the transitional probability that the word 
Z^ is observed when the previous word was Z^_j^. 
A directed graph or trellis as illustrated in Fig. 7 
represents the word lattice that forms the set of sentence 
candidates. Each of the nodes and edges have values or 
costs assigned to them. The node costs are the word 
probabilities for the observed code, while the edge costs 
are the transition probabilities between word pairs. Any 
path between the start and end nodes represents a possible 
sentence. The Viterbi algorithm was used to find the 
maximum-cost path in the trellis using dynamic programming. 
A detailed description of the Viterbi algorithm is provided 
in "Experiments in Text Recognition With Binary N-gram and 
Viterbi Algorithms", IEEE Trans. Pattern Analysis, ^nd 
Machine Intell i gpnr:e, PAMI-4 : 520-53 0 , 1982. With an 
understanding of the invention, skilied persons will also be 
enabled to use hidden-Markov-models, neural networks or 
other analysis approaches in implementation of the - 
invention. 



To compute the word transition probabilities, all word 
pairs in tne Brown Corpus were examined. Each entry in the 
word list dictionary contains the list of words which follow 
it, and the corresponding transitional probabilities. To 
5 account for word pairs which did not occur in the corpus, 

= 1/count^ is used to estimate the probability of 
word f^j^^i given W^, which itself occurred county times in the 
corpus. Since the transition probability is at least 
P{W^ /W.) = l/countPi^. if word pair W^. W. occurs in the 
10 corpus, this gives preference to the pair which actually 
appeared. However, W^^^^ might have a larger stand-alone 
frequency than W^^^, and thus lead to a more costly or 
higher value path and hence better interpretation. 

The Viterbi algorithm takes word and word-pair 
15 frequencies into account, but no grammar rules are 
considered. Thus the selected sentence may not be 
grammatically plausible. To ensure implausible sentences 
are rejected, each word in the dictionary is augmented with 
a tag that indicates the corresponding grammatical word 
20 category. The Brpwn Corpus contains about 30 different word 
tags, which again might be combined with each other for word 
categories. Examples of word tags are shown in Fig. 8. A' 
total -of 132 combinations were used^.each of which became a 
word category in the underlying dictionary. The words that 
25 appear with more than one category became multiple entries 

in the dictionary each with a different tag and with its own 
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Word-pair f reauencies 



frequency cpuTrc. Word-pair frequencies were collected for 
the words in regard to their v ord category tags. 

A corpus was created sol* ly of tags and the period. 
From that collection of sentences, ' bigram (word pair) 
frequencies for the tags were collected. From that data, 
transitional probabilities for the tags were derived, to be 
taken into account by the Viterbi algorithm as additional 
edge cost probability values. The ward and word-pair 
frequency costs of the standard Viterbi algorithm were 
combined with the tagged costs. For some constant 
0 ^ c ^ 1, Costp^th (^"^) * Cost^tand-^^ * Cost^^^^ trades off 
the grammatical categories of the word nodes with their 
transitional probabilities. After experimentation, c = 0.4 
was found to give the best results. The Brown Corpus is the 
source for the word categories. The augmented dictionary 
contained words which did not appear in the corpus, and were 
hence untagged, effecting performance. To reduce the amount 
of words without tags, some tagging "^by hand' was performed 
on the dictionary. 

A performance analysis of results achieved by use of 
the reconstruction system utilizing the invention is 
provided in Fig. 9. Shown are the percentage of 
characters/words identified, errors made, and time spent' for. 
each phase of the algorithm. Of the total errors 71% were' 
made in reconstructing words not in--the dictionary, although 
over 87% of the characters in such words were correctly 
reconstructed. 

Several text samples as identified in Fig. 10 were 




encoded by the telephone keypad code and "reconstructed by 
the system. These samples were passed through a :ilter to 
spell out ordinal and cardinal numbers which occurred in the 
texts. The system performed remarkably well, correctly 
5 decoding more than 95% of all characters in all samples, and 
greater than 99% of the words used in selected speeches of 
President Clinton, which are considered the most 
representative of modern English of these text samples. 
Fig. 12 provides an analysis of the effectiveness of the 

10 different phases of operations. The unknown-word analysis 
accounts for the largest share of errors, which is not 
surprising since none of these words occur in the word list 
dictionary as assembled. Still, the system correctly 
identified over 87% of the characters in these words. 

15 Sentence disambiguation (i.e., selection of the correct word 
in cases of ambiguity) accounts for most of the computing 
time and almost all of the remaining errors, however the 
accuracy is still remarkably high. It is not believed that 
significantly higher accuracy rates are possible without 

20 actual comprehension of the text, which is clearly an 
unsolved and possible unsolvable problem. 

Fig. 11 provides an actual example provided by 
operation of a reconstruction system utilizing the 
invention. The Gettysburg Address was entered using a 

25 standard type of three-letter-per-key telephone keypad. 

After reconstruction by the system the Address included only 
the four reconstruction errors which are shown highlighted. 
With reference now to Fig. 12, there is provided a flow 
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chart useful in describing a sentence reconstruction method 
in- accordance with the invention. At 3 0 there is 
represented a sentence structure in the form of an English 
language sentence 4:o be entered. Fig. 12 particularly 
relates to the resolving of word ambiguities in English 
language sentence structures entered using single stroke 
activation of a standard type of telephone key set including 
number keys which are usable as text entry keys which each 
ambiguously represent a group of three letters. The method 
includes the following steps: 

(a) at step 32 providing first key stroke indicia each 
ambiguously representing a letter of a group of letters and 
second key stroke Indicia including indicia representing 
spaces between words; 

(b) at step 34 partitioning, by use of the second key 
stroke indicia, the first key stroke indicia into a sequence 
of word positions, each word position comprising a code 
block represented by at least one of the first key stroke 
indicia; 

(c) at step 36 accessing a database including a word 
list to identify for an individual word position a word 
group including alternative word choices formable from the 
letter groups represented by the step 34 code block for such 
word position; 

(d) at step 38 repeating step 36 for the sequence of 
remaining word positions to identify a corresponding word 
group including at least one word choice for each of a 
plurality of word positions; 
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(e) at step 40 utilizing a stored word use rule set 
representative of relative frequency of particular word 
usage in the selected language to derive (for the word group 
for one of the word positions) probability values for word 
choices for that word position; 

(f) at step 42 utilizing a stored language rule set 
representative of usage in the selected language to derive 
probability values for a sequencing of individual word 
choices for such word position relative to at least one word 
choice for an adjacent word position in the sentence 
structure, the language rule set including rules in at least 
one of the following categories (i) rules based on 
transitional probability of use of particular word 
sequences, and (ii) rules based on probability of relative 
positioning of words of particular word categories in a 
sentence structure; 

(g) at step 44 repeating steps 40 and 42 for any 
additional word positions having an associated word group 
including a plurality of alternative word choices; 

(h) at step 46 selecting, by use of the probability 
values derived in steps 40 and 42, one word from each word 
group for inclusion at a respective word position in the 
sentence structure; and 

(j) at step 48 using the words selected in step 46 to 
provide a reconstruction of the entered sentence structure 
in at least one of the following forms: a viewable display, 
a printout, a synthesized speech output. 

A sentence reconstruction method as described may 



desirably be ralplemented so that the language rule set 
utilized in step 42 includes rules in both of the categories 
(i) and (ii) and step 46 comprises selecting words for 
inclusion in the sentence structure based upon the highest 
relative probability values as derived in steps 4j0 and 42. 
Particularly for entry of a series of sentences, in step 32 
the second key stroke indicia additionally includes indicia 
representing a period delimiting the -sentence structure and 
step 34 additionally includes partitioning the word 
positions into at least one sentence. As discussed above, 
some word constructions may result in the absence of a match 
upon accessing of the stored word list. Such non-match 
cases can be addressed by including between steps 3 6 and 38 
an additional step as follows: 

(x) for each word position for which no word group 
including at least one word choice is identified in step 36, 
utilizing a stored word assembler unit to attempt to 
identify at least one of a suffix construction, a prefix 
construction and a combination word construction, and to 
thereby identify a word group including at least one word 
choice for such word position. The word assembler unit 
would typically be an additional program unit 18f stored in 
program memory portion 18. 

The invention has been described in the context of use 
with a particular text entry convention and application to 
entry of English language text via a telephone type keypad. 
With an understanding of the invention it will be apparent 
to skilled persons that the invention may be utilized in a 
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wide variety d^conventions and applicatrohs. Thus, the 
zero key or any other suitable convention can be employed 
for entry of the letters Q and Z for example. Also, text"" 
represented in computer memory may "be the result of many 
forms of text entry characterized by the potential for 
ambiguity as to which specific word is intended to be 
present at a particular word position in a sentence 
structure. Other examples and applications arising from use 
of ambiguous or overloaded keyboards include: 

- Text entry for languages with large alphabets. In 
languages such as Chinese, which use character sets with 
thousands of symbols, text may be entered using a 
conventional typewriter keyboard. Thus, the user may type 
in a phonetic pronunciation of each character and the 
resulting character ambiguity can be addressed by sentence 
reconstruction in accordance with the invention. 

- Minimizing hand motion on a QWERTY keyboard. To speed 
text entry or minimize required finger motion on a 
conventional QWERTY keyboard (implicated in physical stress 
ailments such as carpel-tunnel syndrome) a typist can type 
without moving fingers from the "home" row of keyboard keys. 
Thus, one home key is used to ambiguously represent the home 
key letter, as well as the letters on the adjacent keys " 
above and below the home key. With use of the invention on 
a word- processor, for example, the ambiguous entered text 
would be reconstructed and displayed to the typist, who 
could then correct any reconstruction errors. 

- Customized keyboards for text entry. Improved, 
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simplified ancr faster keyboards can be p^vided for 
applications such as court stenography, one-handed chord 
keyboards, or smaller special purpose keyboards for portable 
computers. Resulting ambiguities In entered text can then 
be addressed by sentence reconstruction using the invention. 

- Keyboards for the disabled. Severely handicapped 
persons may not have enough physical control or range of 
movement to use a full keyboard. A special keyboard may 
include only six to ten keys. Entry of text with such 
simplified keyboards will result in ambiguities addressable 
by use of the present invention. Text can thus be entered 
with a significantly reduced range of movement. 

Other examples of activities resulting in word 
ambiguities to which the invention can be applied include 
the following. Voice recognition of spoken or dictated 
sentences produces computerized storage of spoken words 
which may be incorrect or ambiguous because of an inability 
of the voice recognition system to accurately select the 
correct word between words which sound alike and for a 
variety of other reasons such as mispronunciation. If a 
computerized system is adopted to translate from a foreign 
language into English, for example, certain of the foreign 
words may have several potentially correct English 
counterparts, with the translation system unable to 
accurately choose among the English -alternatives. Text 
input in hurried or inexpert fashion via a standard 
typewriter-type keyboard may result in entered text ' 
containing enough errors that for particular word positions 




the entered ^i^tters could represent more than one word. In 
these and other applications, there can exist ambiguities as 
between which of several possible words is intended to be^ 
presented at a particular word position. In the telephone 
5 keypad text entry example, for a particular word position 
the word group is determined by available permutations of 
the three telephone key letters for each letter position of 
a word. Other word association criteria for other 
applications include phonetic word association, similarly 
10 spelled word association, definitional alternatives of 
translated words, etc. In view of the described 
capabilities of systems and methods in accordance with the 
invention, it will be apparent that entry unit 10 of Fig. 1 
can utilize or implement any one of a variety of sentence 
15 structure input techniques or configurations effective to 
result in storage in memory portion 16 of a sentence 



structure with one or more word ambiguities susceptible to 
sentence structure reconstruction methods and systems 



utilizing the invention. Further, while the invention is 



20 



described with reference to English as the selected 
language, it can also be applied for use with other 



languages. 



25 



For purposes of example, particular applications of the 
invention may include communication with hearing impaired 
persons by text entry from any telephone-type keypad without 
telephone operator intervention. The hearing impaired 



person could use a full keyboard and computer screen, with 



text-to-speech synthesizer feedback enabling the telephone 
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keypad user t^hear results of his keypaa input. For E-mail 
without a termiral a telephone user would be enabled to 
write E-mail ant. receive E-mail messages via a speech 
synthesizer. In voice response systems limitations 
5 currently require some user inquiries to be recorded for 

later action. With the invention a user's name, address and 
other information could be entered and reconstructed with 
sufficient fidelity to enable requested literature to be 
mailed without human intervention, for example. 
10 While there have been described the currently preferred 

embodiments of the invention, those skilled in the art will 
recognize that other and further modifications may be made 
without departing from the invention and it is intended to 
claim all modifications and variations as fall within the 
15 scope of the invention. 



-31- 



